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SPECIFICATION 
THERMOPHILIC ENZYMES HAVING 0 - GLYCOSIDASE ACTIVITY 

BACKGROUND OF THE INVENTION 

The present invention relates to a thermophilic enzyme 
having ^ -glycosidase activity. More particularly, the 
present invention relates to a thermophilic enzyme having 

0 -glycosidase activity derived from a hyper thermophi 1 i c 
bacterium belonging to the genus Pyrococcus. 

0 - Glycosidases are useful for hydrolysis of 
saccharides, DNA sequencing, conformational analysis of 
glycoproteins and glycol ipi ds , and enzymatic synthesis of 
oligosaccharides and he t e r o s a c char i de s with high optical 
purities. The catalytic reaction of 0 -glycosidases with 
substrates is specific with respect to the types of the 
monosaccharides constituting the substrates, and the 
optical isomerism and the position of the glycosidic 
linkage to be cleaved in the substrates. 0 -Glycosidases 
are also useful for the modification of sugar chains and 
the synthesis of oligosaccharides and polysaccharides 
retaining their optical stereoisomerism, as well as the 
synthesis of he terosacchar ides (e.g., biosur f ac tant s ) due 
to their ability to transfer a glycoside group into a 
primary, secondary or tertiary alcohol . Hitherto, various 
types of 0 -glycosidases with different substrate- 
specificities have been found in bacteria and plants. 
However, since many of such 0 -glycosidases are derived 
from mesophilic organisms, they are poor in thermal 
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resistance, and consequently are unsuitable for use in 
synthetic reactions under such extreme conditions that 
organic solvents are used simultaneously. 

If a thermophilic /3 -glycosidase active in organic 
solvents is found, this can be used as an biocatalyst to 
develop a new procedure for synthesizing a 
he terosacchar ide with high optical purity. In this 
procedure, the reverse hydrolytic reaction (i.e., 
synthetic reaction) is utilized which predominately occurs 
in the presence of an organic solvent. Under the 
circumstances, a novel & -glycosidase which is active under 
extreme conditions has been strongly demanded. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a 
thermophilic enzyme with /3 -glycosidase activity. 

For solving the above-mentioned problems, the present 
inventors focused on hyper thermophi 1 i c bacteria capable 
of growing within the temperature range from 90 to 100° 
C. As a result, they have found a gene that is assumed to 
encode a protein having jS -glycosidase activity from its 
nucleotide sequence. The inventors have succeeded in the 
production of an enzyme from the gene by introducing the 
gene into Escherichia coli cells to transform the cells 
and then producing the enzyme from the t r an s f o rman t s , which 
enzyme was confirmed to be stable at high temperatures (90° 
C or higher) and to have 0 -glycosidase activity. This 
success leads the accomplishment of the invention. 

That is, the present invention provides a thermophilic 



enzyme having 0 -glycosidase activity which comprises the 
amino acid sequence of SEQ ID NO: 2 in which one or a 
plurality of amino acid residues may be deleted, replaced 
or added. The number of the amino acid residue which may 
be deleted, replaced or added in the amino acid sequence 
of SEQ ID NO: 2 is not particularly limited as long as the 
$ -glycosidase activity is retained, but preferably from 
1 to 30, and more preferably from 1 to 18. It is preferable 
to delete, replace or add an amino acid residue or residues 
present in any of the regions of amino acid residues 7 8 - 86, 
154-171 and 1-423. The enzyme preferably has an optimum 
temperature of 100° C or higher. 

The present invention also provides a DNA which is 
capable of hybridizing to the nucleotide sequence of SEQ 
ID NO: 1 or to the complement thereof under such conditions 
that the hybridization is carried out in 6xSSC and 50% 
formamide at 42 °C and the washing process is carried out 
in 6xSSC and 40% formamide at 25 °C , and which encodes a 
thermophilic enzyme having j3 -glycosidase activity. These 
conditions are of low stringent. A moderate stringent 
conditions are such that the hybridization is carried out 
in 6xSSC and 40% formamide at 42 °C and the washing process 
is carried out in lxSSC and 0% formamide at 5 5 °C . A high 
stringent conditions are such that the hybridization is 
carried out in 6xSSC and 30% formamide at 42 C and the 
washing process is carried out in O.lxSSC and 0% formamide 
at 62 °C . The DNA may encode a thermophilic enzyme which 
comprises the amino acid sequence of SEQ ID NO: 2 in which 
of one or a plurality of amino acid residues may be deleted, 



replaced or added and which has /? - glycosidase activity. 

The present invention further provides a recombinant 
vector containing the DNA therein, a host cell transformed 
with the recombinant vector, and a process for producing 
the enzyme comprising culturing a host cell transformed 
with an expression vector containing a DNA encoding the 
enzyme and then collecting the enzyme from the resultant 
culture. Using this process, the mass production of the 
enzyme becomes possible. 

The present invention further provides a process for 
the hydrolysis of a P -glycoside having a long alkyl chain 
at the reducing end, with a thermophilic enzyme having 0 
-glycosidase activity which comprises the amino acid 
sequence of SEQ ID NO: 2 in which one or a plurality of 
amino acid residues may be deleted, replaced or added. The 
long alkyl chain may be an alkyl group having carbon atoms 
of 8 or.more. The hydrolysis may be carried out at a 
temperature of 85° C or higher, and preferably 100° C or 
higher . 

This specification includes part or all of the contents 
as disclosed in the specification and/or drawings of 
Japanese Patent Application No. 10-222866, which is a 
priority document of the present application and 
incorporated herein by reference in its entirety. 

The above and other objects, effects, features and 
advantages of the present invention will become more 
apparent from the following description of embodiments 
thereof taken in conjunction with the accompanying 
drawings . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the effect of the Triton X- 100 
concentration on the His-BGPh Activity. The standard of 
100% was defined as the activity at 0.1% Triton X-100. 

Figure 2 shows thermostability of His-BGPh at 90°C. 
Triton X-100 at 0.1% was present in the reaction mixtures. 
The standard of 100% was defined as the activity without 
hea ting . 

Figure 3 shows optimum pH of the activity for His- 
BGPh. The OD405 indicates the amount of released p - Nph 
group in acetate buffer (square) and phosphate buffer 
(circle). The closed symbols correspond to the activity 
of BGPh and open symbols correspond to the activity of 
His-BGPh. For these measurements, equal amounts of BGPh 
and His-BGPh were used because the heated suspension I 
(BL21 (DE3) /pET-lla/BGPh or BL21(DE3)/pET-15b/BGPh) was 
estimated to contain the same amount of each induced protein 
by quantification using SDS-PAGE analysis. 

Figure 4 shows temperature dependency of BGPh. 
Optimum temperature was determined by the plots of 
enzymatic activity (OD405 nm change) against reaction 
temperature. An Arrhenius plot of the data is given in the 
inset. 

Figure 5 shows aligned amino acid sequences of five 
0 - glycosidases from hyper thermophi 1 ic archaea. The 
abbreviations of the \sources of the enzymes are : BGPh, 
0 - glycosidase from R. horikoshii; B M P h , a & - 
mannosidase gene homolig from P. horikoshii (8, 9) ; BG P f , 




jS - glucosidase fyrom P. furiosus (17) ; BMPf , 0 - mannosidase 
from P. furiosus! (17) ; S £ - gly, 0 - glycosidase 
f romSul f olobus solfataricus (18) . The conserved residues, 
identified automatically by the GeneWorks program, are 
shown in the open boxes. The reversed open triangles 
indicate the location of the nucleophile (E324) and the 
putative acid/base datalyst (E155 and Hill) with R75 in 
the spatial proximityy of the nucleophile of BGPh. The 
arrow shows the prominent deletion of more than 30 residues 
found in BGPh. 

Figure 6 shows illustrated location of the four 
hydrophilic edges on the tetragonal structure of S@ - gly 
(30) and the four hydrophobic areas exposed by removing 
the hydrophilic loops forming the edges. (A) The 
tetragonal arrengement with the hydrophilic edges (blue) . 
(B) The tetragonal arrengement with a hydrophobic surface 
(red) created by the deletion of the hydrophilic loops, 
shielding barrel helices 3 and 4 from solvent. 

Figure 7 shows a comparison of hydropacy profiles 
between BGPh and S /3 - gly. The panel (A) shows the 
hydropacy profile of BGPh and panel (B) shows the hydropacy 
profile of S 0 - gly. The arrows indicate the corresponding 
residue numbers. Two hydrophobic clusters are observed in 
BGPh but not in S £ - gly. 



DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention will be described specifically 
be 1 ow . 

The enzyme according to the present invention is a 
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thermophilic enzyme having 0 -glycosidase activity which 
comprises the amino acid sequence of SEQ ID NO: 2 in which 
one or a plurality of amino acid residues may be deleted, 
replaced or added. The enzyme comprising an amino acid 
sequence of SEQ ID NO: 2 and having 0 -glycosidase activity 
is derived from a su 1 f u r - me t ab o 1 i z ab 1 e thermophilic 
archaeon Pyrococcus horikoshii (the accession number: JCM 
9974). One example of the processes for producing the 
enzyme is described below. 

First, cells of Pyrococcus horikoshii are cultured and 
^ then chromosomal DNA was prepared therefrom. The 

chromosomal DNA is digested with restriction enzyme (s) to 
^ give fragments, and a genomic DNA library is constructed 

using the fragments. Clones which cover the chromosome of 
in Pyrococcus horikoshii are selected and aligned. The 

S3 aligned clones are sequenced and a gene encoding a 0 - 

£3 glycosidase is identified. The nucleotide sequence of the 

\8 gene encoding 0 -glycosidase is depicted in SEQ ID NO: 1. 

The gene is amplified by the PCR method and then extracted. 
The extracted gene is inserted into an expression plasmid 
suitable for protein production (e.g., pETlla or pET15b) . 
The resultant recombinant plasmid is introduced into cells 
of a host (e.g., Escherichia coli), from which the enzyme 
can be produced. The produced enzyme is isolated and 
purified by heating and then subjecting to column 
chroma tography. 

As a result, it is revealed that the purified enzyme 
is a protein having a molecular weight of about 45,000 Da 
and capable of hydrolyzing 0 -glycosides. When the enzyme 
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is treated in 50 mM phosphate buffer (pH 6.0) containing 
2 50 mM NaCl at 95° C for 1 hour, its activity is retained 
at the level of 80% based on the initial level. The enzyme 
has an optimum pH of pH 6.0 and an optimum temperature of 
100° C or higher in terms of the enzymatic activity- 
Variants of the enzyme, that is, thermophilic enzymes 
comprising deletion, replacement or addition of one or a 
plurality of amino acid residues in the amino acid sequence 
of SEQ ID NO: 2 and having # -glycosidase activity, may be 
prepared by any known techniques, such as s i t e - sp e c i f i c 
mutagenesis and the PCR method. 

The enzymes of the present invention can be used for 
hydrolysis of saccharides, DNA sequencing, conformational 
analysis of glycoproteins and glycolipids, synthesis of 
origosacchar ides and he terosacchar ides with high optical 
purities, and the like. 

DEPOSIT OF MICROORGANISM 

A transformant designated 11 E. coli BL21 (DE3) 
pETl 5b/Gly2M" which is E. coli BL21 (DE3) transformed with 
an expression vector containing a 0 -glycosidase gene 
(pETl 5b/Gly2M) was deposited under the terms of the 
Budapest Treaty on July 27, 1999 at the National Institute 
of Bioscience and Human - technology , Agency of Industrial 
Science and Technology, Japan (1-3, Higashi 1-chome, 
Tsukuba-shi, Ibaragi-ken, Japan) under Accession No. FERM 
BP - 6 8 0 0 . 

The following examples are given as more specific 
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illustration of the invention. It should be understood, 
however, that the invention is not limited to the specific 
details set forth in the examples. 

EXAMPLES 

M^br-pv iai-.inns : BGPh, 0 -glycosidase from P. horikoshii; 
BMPh , a 0 -mannosidase gene homolog from P. horikoshii; 
BGPf, 0 -glucosidase from P. furiosus; BMP f , 0 -mannosidase 
from P. furiosus; S 0 -gly, 0 -glycosidase f romSul f olobus 
sol f atar icus ; Amp, ampicillin; IPTG, i s opropy 1 - 0 - D - 
thiogalactopyranoside ; His-BGPh, BGPh with His-tag; 
SDS-PAGE, sodium dodecyl su 1 f a t e - po 1 y a c ry 1 ami de gel 
electrophoresis; CBB R , Coomassie Brilliant Blue R; X-Glu, 
5-bromo-4-chloro-3-indolyl- 0 -glucopyranoside; p-Nph- 0 
-D-Glcp, p - ni tr opheny 1 0 - D - glucopyranoside ; LA - 0 -D-Glcp, 
0 - D - g 1 ucopy r ano s i de s with long alkyl chains. 

MATKRTAT.fi AND METHODS 

Chemicals - The pET-lla vector and ul tracompe tent E. 
coli XL2-Blue MRF 1 cell were purchased from Stratagene. 
The pET-15b vector and E. coli strain BL21 ( DE 3 ) were 
obtained from Novagen. Vent DNA polymerase was purchased 
from New England Biolab's. Restriction enzymes were 
purchased from Promega and Toyobo (Osaka, Japan) , and were 
used according to the manufacturers' recommendations. 
Ultrapure deoxynu c 1 e o t i de solution (dNTPs) was obtained 
from Pharmacia Biotech. Isopropyl-/3 - D- 

thiogalac topyranoside (IPTG) was from Takara Shuzo (Otsu, 
Shiga , J apan) . 



Cloning of Genes and Cons truct ion of Expression Vector 
The genome of P. horikoshii was sequenced using the 
method of Kaneko et al. (10) . Standard cloning techniques 
were used throughout. The expression vectors pET-lla and 
pET-15b were doub 1 e - d i g e s t e d by the restriction enzymes 
Nde I and BamH I and the resulting 5.7Kbp fragment was 
purified with a QIAquick Gel Extraction Kit (QIAGEN) . The 
gene coding P - glycosidase (BGPh) was amplified by the PCR 
method using the following two primers: upper primer, 
T AAGAAGGAGAT ATAC AT ATGC CGCTG AAATTCC C GGAAATGTTTCTCTTTGGT 
ACC (SEQ ID NO: 3); lower primer, 

TTT ACTGC AG AG AGG ATC C CTAAT C CT AAAGTTG AAGTT CTGGTAG (SEQ ID 
NO: 4) . The PCR product was cloned into expression vectors 
pET-lla and pET- 15b using Ndel and BamHI sites. 

The digested 1.3 Kbp fragment coding BGPh was purified 
and ligated to the insertion sites of the pET-lla and 
pET-15b vectors. Ul tracompe tent E. coli XL2-Blue MRF 1 
cells were transformed with the recombinant molecule. 
Transf ormants were screened on 2 xYT plates containing 50 
mg/ml of ampicillin (Amp) incubated at 37°C overnight. 
The transformant colonies were propagated in 5 ml 2 x YT 
+ Amp medium at 3 7 °C overnight and the vectors pET-lla/BGPh 
and pET-15b/BGPh were purified after c en t r i f uga t i on using 
a Mini Plasmid Kit (QIAGEN). The pET-lla/BGPh and pET- 
15b/BGPh were doub 1 e - d i g e s t e d with Ndel and BamHI and the 
insert length was checked using agarose gel 
electrophoresis. The absence of additional mutations 
within the coding region of BGPh was verified by sequencing 
on an Applied Biosystems 373A DNA sequencer (Taq DyeDeoxy 
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Terminator Cycle Sequencing Kit, PerkinElmer ) . 

Overexpression and Purification of Recombinant Protein 
The E. coli strain BL21 (Dfi3) was transformed with the 
pET-lla/BGPh plasmid to express mature BGPh and pET- 
15b/BGPh plasmid to express His-tagged BGPh. The 
transformant colony was propagated as seed culture in 200 
ml 2 x YT +Amp medium at 3 7 °C overnight. An inoculate of 
40 ml seed culture was inoculated to 2 1 of 2 x YT +Amp 
medium. The transformant was induced at OD600 = 1 with 1 mM 
IPTG for 4 h. The induced cells were collected by 
centr i f ugat ion and stored at -20 C 

The frozen cells (7 g) were thawed and mixed with 10 
ml of 50 mM Tris-HCl buffer (pH 7.5) and 5.6 ml of 10% Triton 
X-100, resulting in a final concentration of 2.5%. The cell 
suspension was heated at 85°C >for 10 min, then centrifuged 
at 5000 x g for 20 min. The supernatant was collected and 
stored at 4°C . The cell pellet was mixed with the same 
volume of the buffer and Triton X-100 and heated again. 
The heated sample was centrifuged at 25000 x g for 20 min. 
The combined supernatant was mixed with 1 mg of bovine DNase 
1 (Sigma) and incubated at 37°C for 30 min. The supernatant 
was heated at 85°C for 10 min, then centrifuged at 25000 
x g for 20 min to remove the inactivated DNase. 

The solubilized recombinant BGPh with His-tag 
(His -BGPh) was subjected to affinity chromatography with 
Ni - con j uga ted Sepharose, using a stepwise elution from 5 
mM to 1 M imidazole in 20 mM Tris-HCl (pH 8.0) with 0.5 
M NaCl solution (His-bind Buffer Kit, Novagen) containing 
0.1% Triton X-100. BGPh was eluted with 100 mM imidazole 
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with 0.1% Triton X-100. The enzyme samples were analyzed 

by sodium dodecyl s u 1 f a t e - po 1 y ac ry 1 ami de gel 
electrophoresis (SDS-PAGE) (11); a low molecular weight 
electrophoresis calibration kit, purchased from Pharmacia 
Biotech, was also run. For SDS-PAGE (PhastGel, 10-15%), 
the enzyme sample (5 ml) was mixed with SDS sample buffer 
(5 ml) , boiled for 5 min, mixed with marker dye (1 ml) and 
applied to the gel in 1 or 4 ml aliguots. Following 
electrophoresis, protein was detected by Coomassie 
Brilliant Blue R (CBBR) staining according to the 
manufacturer's recommendation. The His-tagged protein 
was detected with QIAexpress Detection System (QIAGEN) 
after blotting onto a nitrocellulose membrane (Pharmacia 
Biotech) . 

Cellular Localization of the Activity - Localization 
of the BGPh activity in E. coli transformant cells 
(BL21 (DE3) /pET-lla/BGPh or BL2 1 (DE3 ) /pET - 15b/BGPh) was 
examined by fractionation of the cell components. The cell 
membrane was isolated as follows: 7 g of the induced cells, 
which were frozen at -20°C , were thawed and mixed with 10 
ml of 50 mM Tris-HCl buffer (pH 7.5) . The cell suspension 
(suspension I) was sonicated with a Sonifier 250 (Branson) 
for 4 min at an output control level of 4 and at 30% duty 
cycle. The sonicated sample was centrifuged at 9,000 x g 
for 10 min to remove cell debris, then the supernatant (12 
ml) was ul tracen tr i f uged at 100,00 0 x g for 1 h to separate 
the membrane fraction (1 ml) from the supernatant. The 
enzyme reactions were carried out at 90 C for 15 min in 
a solution (200 ml) containing 1.2 mM 5-bromo-4- 



chloro - 3 - indoly 1 - & • glucopy ranos ide (X-Glu) and 5 ml of 
each fraction, as the enzyme source, in 50 mM phosphate 
buffer (pH 6) with 0.3 M NaCl. After the reaction, the 
solution was cooled in ice and diluted with 1 ml of water; 
the absorbance at 620 nm was immediately measured. As a 
control, the assay reactions were performed under the same 
conditions but without X-Glu to subtract the turbidity 
derived from each fractionated sample. 

To analyze the solubilizing effect of Triton X-100, 
suspension I was also heated with and without 2.5% Triton 
X-100 at 85°C for 10 min and the supernatant was obtained 
by centr if ugation at 15,000 x g for 10 min. The activity 
of the supernatants was measured using X-Glu as shown above. 

Dependence of the BGPh Activity on Triton X-100 - The 
enzyme reactions were carried out at 98 C for 20 min in 
a solution ( 2 0 0 ml) containing 3 mM p-Nph-/3 - D-Glcp (a 
p - ni tropheny 1 saccharide) and 57.5 pM of the purified 
His - BGPh in 5 0 mM phosphate buff er (pH 6) wi th Tr i ton X - 1 0 0 
and 0.1 M NaCl. The concentration of Triton X-100 in the 
reaction solution was varied from 0.1% to 0.00002%. The 
reaction was terminated by the addition of 1 M Na2C03 (1 
ml), then centrifuged at 15,000 x g for 10 min. The 
concentration of the p-Nph group in the supernatant was 
quantified by measuring the absorbance at 400 nm . 

Measurement of the Kinetic Parameters - The enzyme 

reactions were carried out at 90 °C in a solution (20 0 ml) 
containing the substrate and the purified His- BGPh in 50 
mM phosphate buffer (pH 6) with 0.1% Triton X-100 and 0.3 
M NaCl. For the hydrolysis of p - ni tropheny 1 (p-Nph) # 
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- D - saccharides , the reaction was terminated by the 
addition of 1 M Na2C03 (1 ml), then centrifuged at 15,000 
x g for 10 min. The concentration of the p-Nph group in 
the supernatant was quantified by measuring the absorbance 
at 400 nm . For the hydrolysis of 0 - D-glucoside, the 
released glucose was analyzed with a Glucose C-II Test kit 
(Wako Pure Chemicals, Japan). Initial velocities were 
obtained directly from the initial slopes of the time course 
plots. The Km and kcat values were calculated using the 
Mi chae 1 i s - Men t en equation and the least squares method 
(12). The subsite affinity for a long alkyl chain was 
determined using the method reported previously (13-15) 
on the basis of the subsite theory (16) . 

Optimum Temperature and Optimum pH - The optimum 

temperature was measured as follows: the assay mixture (200 
ml), which contained 3 mM p - ni t r opheny 1 £ - D- 
glucopy r ano s ide (p-Nph- 0 - D-Glcp) in 150 mM citrate buffer 
(pH 5.0) and 1 ml of suspension I (BL21 (DE3) /pET-lla/BGPh) , 
was heated at 85°C for 10 min. The enzyme reactions were 
carried out in duplicate at temperatures ranging from 50 C 
to 100 °C for 30 min. Optical density measurements at 
A405 were performed as described for the enzyme assays. 

To determine the optimum pH, the assay mixture (200 
ml), which contained 1 ml of heated suspension I 
(BL21(DE3)/pET-lla/BGPh or BL21(DE3)/pET-15b/BGPh) and 
p-Nph - 0 - D-Glcp (3 mM) in 139 mM buffer systems, was heated 
at 90 °C for 30 min. The pH of the reaction mixtures ranged 
from 3.9 to 5.5 in sodium acetate buffer and from 5.5 to 
7.99 in phosphate buffer. Optical density measurements 
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at A405 were performed as described for the enzyme assays. 

Thermostability - The His-BGPh solutions (29 nM) in 50 
mM phosphate buffer (pH 6.0) containing 100 mM NaCl and 
0.1% Triton X-100 were heated in sealed Eppendorf tubes 
at 90°C in various increments up to 24 h. The heated 
enzymes were assayed in duplicate in phosphate buffer (pH 
6.0) at 90°C for 20 min as described for the determination 
of optimum temperature. 

Sequence Alignment, Phylogenetic Tree, and Hydropacy 
Profile - Sequence alignment of 0 - glycosidases was 
performed using the GeneWorks program ( I n t e 1 1 i Gen e t i c s , 
Inc.) based on a PAM-250 scoring matrix. The enzymes of 
interest were: 0 - glycosidase (BGPh) studied in this paper 
and 0 - mannosidase (BMPh) from P. horikoshii (8, 9) , 0 
- glucosidase (BGPf) and 0 - mannosidase ( BMP f ) from P. 
furiosus (17) , and 0 - glycosidase (S 0 - gly) 
f romSul f olobus solfataricus (18). Phylogenetic trees 
for the same sequences were constructed using the GeneWorks 
program based on the unweighted pair group method with an 
arithmetic mean (19) . Each hydropacy profile was analyzed 
with DNASIS-Mac v2 . 0 software based on the Kyte and 
Doolittle method (20). 



PF!.qTTT,TS AMD D T fi C. IT S S T O N 

Local i zat ion of the Activity in E. coli Membrane - The 
intracellular localization of His-BGPhwas examined (Table 
I) . 
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Table I. Cellular localization of the activity. The Iransformant E. coli BL2i(DE3)/pET!5b/BGPh 
cells were used for this experiment. The enzyme reaction were performed at 90°C and pH 6 for 15 
min using X-Glu as substrate, and then A f>20 was measured as shown in "MATERIALS AND 
METHODS". 



Activity after each treatment (A G20 ) 



Cell fractions 



Sonication Non-heated Heated Non-heated with Heated with 

2.5% Triton X-100 2.5% Triton X-100 



Suspension I 0.585 - 0.585 0.567 0.485 0.428 

Supernatant at 9,000 xg 0.112 ND ND ND ND 

Supernatant at 15,000 xg ND 0.008 0.005 0.107 0.255 

Supernatant at 100,000 xg 0.010 ND ND ND ND 

Fraction precipitated at 0.478 ND ND ND ND 

100,000 xg 

ND; not determined. 

The induced cells were disrupted by sonication and 
centrifuged to separate the cell components. The membrane 
fraction was precipitated by ul trac en t r i f uga t ion at 
100,000 x g from the supernatant recovered by 
cen tr i f uga t ion at 9., 0-0 0 x g. The activity was present in 
the membrane fraction whereas no activity was detected in 
the soluble fraction after the ul tracentr i f uga t ion . 
His-BGPhwas solubilized from the cell suspension 
(suspension I) by heating with a detergent, Triton X-100; 
the enzyme was not solubilized by heating without Triton 
X-100. The solubilizing efficiency with Triton X-100 was 
elevated by heating up to 85 °C , whereas only 22% of the 
activity was extracted at room temperature. The best 
condition for the solubilization was 2.5% Triton X-100 at 
8 5 °C for 15 min. The native -type BGPhwas also solubilized 
under the same condition as His-BGPh (data not shown) ; 
however, the denaturation with 8 M urea and the renaturation 
by direct dilution with buffer had no effect on the 
solubilization of the activity (data not shown) . These 
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facts strongly indicate that BGPh is a thermostable 
membrane protein solubilized by Triton X-100. 

His - BGPh was purified by one-step affinity 
chromatography using Ni - con j uga ted Sepharose. Since the 
recovery of the active enzyme was decreased to a few percent 
by the elimination of Triton X-100 from the chromatographic 
washing and elution buffers, the presence of Triton X- 
100 in the buffer system was essential for the stabilization 
of BGPh. 

As shown in Figure 1, the activity of BGPh was dependent 
?1 on the concentration of Triton X-100. At 0.00002% Triton 

V t% 

u\ X-100, the activity decreased to 10% of that with 0.1% 

Triton X-100. Furthermore, BGPh was stabilized in the 
sTl presence of 0.1% Triton X-100: the half-life of the activity 

%n was 15 h at 90°C and pH 6.0 (Fig. 2). These facts also 

2 

y suggest that BGPh is the membrane protein. 

O The Substrate Specificity of BGPh - For BGPh both with 

y3 or without His-tag, the optimum pH was 6.0 (Fig. 3) and 

the optimum temperature was over 100 °C (Fig. 4) . The 
substrate specificity of His -BGPh was examined using 
p-Nph- P - D - sacchar i de s and & - D-glucosides as substrates. 
The specificity is summarized in Table II in comparison 
with that of S/3-gly (7, 21). 
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Table EL Comparison of the kinetic parameters between his-tagged BGPh from P. horikoslui and 
S(S-gly from S. solfataricus strain MT-4 against p-Nph-p%D-saccharides and (3-D-glucosides: 

Hh-BGPh (90X and pH 6.0) Sp-gly* (75°C and pH 6.5) 

Substrates (sec 1 ) (mM) (raW'sec' 1 ) (sec -1 ) (mM) (mM'sec' 1 ) 



Laminaribiose 


184 


138.23 


Cellobiose 


194 


1698.18 


Cellotriose 


ND" 


ND 


<w c uo t c crao s c 


NT) 


NT) 


R-Genriobiose 


ND 


ND 


n-Nnh-8-D-Glcz? 


79 


0.35 


n-Nnh-R-n-Oaln 


123 


1.30 


n-Nnh-R-D-Xvln 


3 


0.10 


/7-Nph-(3-D-Manp 


2 


0 14 


Salicin 


44 ' 


1.96 


Methyl- p-D-Glcp (Alkyl : 


C.) 35 


40.74 


n-Amyl-p-D-Glcp (Alkyl : 


:Q) 31 


2.02 


n-Hexyl-P-r>Glcp (Alkyl 


:Q) 33 


0.54 


n-Octyl-P-D-Glcp (Alkyl : 


Q) 34 


0.20 


n-Nonyl-p-D-Glcp (Alkyl 


: C 9 ) 39 


0.08 


n-Decyl-|3-D-Glcp (Alkyl 


:C 10 ) 37 


0.08 


n-Undecyl-(3-D-Glcp (Alkyl:C H ) 43 


0.05 


n-Dodecyl-P-D-Glcp (Alkyl:C, 2 ) 36 


0.03 



1.33 


908 


1.0 


908.0 


0.11 


1333 


30.0 


44.4 


ND 


197 


3.0 


66 




jo4 


1 7 




ND 


1360 


100 


14 


225.67 


542 


0.5 


1084.0 


94.34 


1020 


4.7 


217.0 


31.83 


284 


4.0 


71.0 


14.60 


NH= 


NH 


NH 


22.20 


880 


5.0 


175.9 


0.85 








15.11 


256 


1.1 


232 


60.28 


263 


1.0 


263 


170.70 


313 


0.7 


434 



471.57 
469.62 
944.37 
1152.90 



1 Citated from references (7, 21). 

b ND; The parameters were not determined because of too high K m values. 

c NH; The substrate was not hydrolyzed by S(S-gly. 
a -; The parameters were not reported in the references. 
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His -BGPh hydrolyzed aryl glycosides efficiently, 
showing kcat/Km values decreasing in the order p-Nph-/3 - 
D-Glcp > p-Nph-jS- D-Galp > p-Nph-0- D-Xylp > p-Nph-/3- 
D-Manp. Beta-linked glucose dimers tested were poorly 
hydrolyzed; the order of preference was #1-3 > #1-4 > 0 
1-6. The kcat values of BGPh without His - tag for these 0 
- linked glucose dimers approached 400 sec* 1 , which is 
comparable with those of S 0 - gly (Table II). His -BGPh 
probably had approximately 50% of the activity of BGPh due 
to interference by the His-tag located at the N- terminus, 
(Fig. 3) . Surprisingly, the best substrates for His - BGPh 
were 0 - D - g 1 u c o s i de s with long alkyl chains (LA- 0 - D-Glcp) . 
The Km values decreased according to the elongation of the 
alkyl chain from C 1 to C 12 , although the kcat value was 
constant (approximately 35 sec 1 ) for each alkyl - 0 - D-Glcp. 
The kcat values of native type BGPh for LA-£ - D-Glcp 
approached 70 sec" 1 , calculated on the basis of the value 
of His -BGPh, estimating a 50% decrease in the activity from 
the inhibitory effect of the His-tag. The value was also 
appreciable, around 30% of that of S & - gly (Table II) . The 
Km value of His -BGPh for the hydrolysis of n-Dodecyl-0- 
D-Glcp (alkyl chain : C 12 ) was extremely low, 30 mM at 90 C 
and pH 6.0. Of the substrates examined thus far, the best 
substrate was n-Dodecyl-/3 - D-Glcp as shown in Table II. 
The kcat/Km value of His -BGPh against n-Dodecyl- 0 - D-Glcp 
was 5 times higher than that of p-Nph-/3 - D-Glcp and 870 
times higher than that of laminar ibiose . Even the value 
for n-Octyl-/3 - D-Glcp was 0.76 times higher than that of 
p-Nph-/3 - D-Glcp and 128 times higher than that of 
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laminar ibiose . The kcat/Km value of S# - gly against 
n-Octyl-# - D-Glcp, with the longest alkyl chain so far 
examined (21), was 0.4-fold higher than that for p-Nph- 
/? - D-Glcp and 0.48-fold higher than that for laminaribiose. 
Laminar ibiose and cellobiose were not good substrates for 
the hydrolysis of His-BGPh because of their Km values higher 
than 100 mM . His-BGPh also hydrolyzed cellotriose and 
cellote traose with low efficiency: the kinetic parameters 
were not determined because of the extremely high Km value, 
whereas S /3 - gly was able to hydrolyze these 
oligosaccharides with high efficiency: the kcat/Km values 
descended in the order; c e 1 1 o t e t rao s e > cellotriose > 
cellobiose. Thus, the substrate specificity of His-BGPh 
is different from those of the other 0 - g 1 y c o s i da s e s , 
including S £ - gly (7, 17, 21-23). BGPh has a novel 
substrate specificity with high efficiency to hydrolyze 
LA- 0 - D-Glcp and low efficiency to hydrolyze any £ - linked 
glucose dimer which is more hydrophilic than aryl- or alkyl - 
0 - D-Glcp. The subsite affinity (A (C11) ) to bind a long 
alkyl chain (cll) was calculated according to the following 
equation; A (CU) =RT( (kcat/Km) for n ; Dodecyl j . d -gic P / (kcat/Km) for 
Methyl-^ - o-Gicp) - The affinity was determined to be 4.26 
kcal/mol. The value was reasonable when compared with the 
highest affinity (4.23 kcal/mol) known, that of the 
recognition of one glucose unit iii the subsite structure 
of Sac charomycop s i s amylase (13, 14) . These facts 
indicate that the hydr ophob i c i t y of the aglycon part of 
the substrates is strongly recognized by the BGPh molecule 
and the hydrophobic substrates, including aryl- and LA- 
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0 - D-Glcp, are hydrolyzed effectively with low Km values 
due to hydrophobic interaction between the aglycon moiety 
and the BGPh molecule. Thus, BGPh might be useful to 
synthesize novel P - glycosides, including new 
biosur f ac tant s , using its t ransglycosy la tion activity 
because of its stability in organic solvents (data not 
shown) . 

Henrissat proposed an alternate and complementary 
classification scheme for glycosyl hydrolases based on 
amino acid sequence similarities (24-26). For example, 
glycosyl hydrolase family 1 is composed of exo-acting, £ 
- specific enzymes with similar amino acid sequences. The 
five # - glycosidases , including BGPh from the archaea 
domain (as shown in Fig. 5), belong to family 1. Some 
family 1 glycosyl hydrolases also have glycosyl 
transferase activities. The S. solfataricus £ - 
glucosidase has been implicated in the glycosy la t ion of 
membrane lipid components (27) . Similarly, the enzymatic 
analysis of BMP f predicted its possible role in the 
synthesize of intracellular components including protein, 
membrane components, or other compounds (17). Since the 
localization of BGPh on E. coli membrane strongly indicates 
the intimate interaction of the enzyme and lipid components, 
the detection of BGPh on the Pyrococcus cell surface using 
antibody against the enzyme must be done to clarify its 
true function in the Pyrococcus cell. 

The Structural Element s Responsible for Membrane 
Localization and the Conservation of Residues Forming the 
Active Site - The sequence alignment among BGPh and four 
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different 0 - glycosidases, whose biochemical 
characteristics have been reported (7, 17, 21-23) , is shown 
in Figure 5. According to the phylogenetic analysis based 
on the alignment, the tree has three branches : one 
corresponding to a )3 - glycosidase group that includes BGPf 
and S £ - gly; another containing BMPh and BMP f , which were 
close to $ - mannosidase. BGPh belongs to the thirdbranch, 
located some distance from the first two branches. The 
polypeptide length of BGPh is also approximately 13% 
shorter than those of the other four £ - glycosidases and 
might be one of the shortest sequences so far reported (8, 
17, 18, 28) . As shown in Figure 5, the residues E155 and 
Hill of BGPh correspond to E20 6 and H150 as the putative 
acid/base catalyst in the S )S - gly molecule (28, 29 ) , whose 
steric structure has been reported (30). The residues 
E324 and R75 of BGPh correspond to E3 87, the nucleophile, 
and R79 in the spatial proximity of the nucleophile (28, 
29) . The complex structure of Bacillus polymixa 0 - 
glycosidase with the inhibitor gluconate has been reported 
(31). The BGPh residues, Q19, Hill, N154, E155, Y267, 
E324, W3 62 , E369, and W370 are completely conserved (Fig. 
5) and correspond to the B. polymixa glycosidase 
residues, Q20, H121, N165, E166, Y296, E352, W398, E405, 
and W406, which form the intimate interaction with the 
inhibi tor (31). 

To understand the localization mechanism of BGPh to 
the membrane, a major structural difference between BGPh 
and the other soluble & - glycosidases was analyzed using 
the sequence alignment and the steric structure of S& - 
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gly (30) . The S]3 - glymolecule has the classic ( & a) Q barrel 
fold first seen in the structure of triose phosphate 
isomerase (32) . For BGPh, the prominent deletion of more 
than 30 residues was found after the 78th residue, as 
indicated in Figure 5. The deletion region of BGPh 
corresponds to loops from the 89th to 125th residues of 
S0 - gly* mainly shielding the helices 3 and 4 from solvent. 
The hydrophilic loops, which pack against the outer face 
of the barrel helices 3 and 4, were not present in the BGPh 
molecule. A tetrameric S jS - gly structure has been 
reported, in which these loop regions were located at the 
four edges of regular tetragonal molecular arrangement 
(30) . Figure 6 illustrates the location of the four 
hydrophilic edges and four hydrophobic areas which appear 
following the removal of the hydrophilic loops. Since BGPh 
as well as S 0 - gly was proved to be tetramer by gel 
filtration using buffer containing 0.01% Triton X-100 
(data not shown), the deletion of these hydrophilic loops 
probably results in the exposure of helices 3 and 4 to the 
solvent at the four edges of the tetrameric structure. The 
exposed hydrophobic areas might interact with lipid 
components to embed the molecule in the membrane. 

The increased hy d r ophob i c i ty at barrel helices 3 and 
4 is also indicated by the compari son of the hydropacy plots 
of BGPh and S /3 - gly, as shown in Figure 7. Two major 
hydrophobic clusters were observed in the region of BGPh 
between residues 79 and 210 corresponding to the region 
of S 0 - gly between residues 90 and 265. These residues 
form tertiary structures from the end of # - sheet 2 to the 
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beginning of jS- sheet 5 of the ( 0 a ) 8 barrel fold (30). 
The first cluster was located between residues 79 to 114, 
forming a helix with a loop shortened by the deletion, 
a-helix 2, and 0 - sheet 3. The second cluster was present 
between residues 131 and 210, corresponding the barrel 
fold between a-helices 3 and 4 exposed to solvent. A 
hydrophilic module that might be important for enzyme 
orientation on the membrane was found between residues 114 
to 131, corresponding to the hydrophilic helices at the 
molecule surface located between the 0 - strand and the 
a-helix in the third repeat of the barrel fold. The two 
hydrophobic clusters, but not the hydrophilic module, were 
lacking in the corresponding region of S0 - gly (18) . 

A mechanism for the localization of BGPh is proposed 
here based on the possible hydrophobic interaction between 
the membrane and the exposed hydrophobic helices 3 and 4 
at the four edges of the tetrameric structure exposed by 
the deletion of the hydrophilic loops. Furthermore, the 
mechanism is well supported by the hydropacy profile of 
BGPh, in which the hydrophobic cluster is formed by the 
barrel fold between a-helices 3 and 4. The exposed 
hydrophobic areas may lead the hydrophobic substrates to 
the active site and bind them there. However, further 
studies using the cry s tal lographic analysis are needed for 
a more definitive description of the detailed mechanism 
for recognition of the hydrophobic aglycon part, including 
a long alkyl-chain. 

As described above, the present invention provides a 
novel 0 -glycosidase. The 0 -glycosidase is stable under 
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extreme conditions. Therefore, the /3 -glycosidase can be 
used to develop he terosacchar ides with high optical 
pur i ties. 
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cited herein are incorporated herein by reference in their 
entity. 

The invention has been described in detail with 
reference to various embodiments, and it will now be 
apparent from the foregoing to those skilled in the art 
that changes and modifications may be made without 
departing from the invention in its broader aspects, and 
it is the invention, therefore, in the appended claims to 
cover all such changes and modifications as fall within 
the true spirit of the invention. 

The following is information on sequences described 
here in : 

SEQUENCE LISTING 

<110> Director - General of Agency of Industrial Science and Technology 
< 120 > Heat-resistant enzyme having 0 - glycosidase activity 
<130> PH-679US 



<150> JP 10-222866 

<141> 1998-08-06 

< 1 6 0 > 4 

<170> Patentln Ver. 2.0 

< 2 1 0 > 1 
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<211> 1269 

< 2 1 2 > DNA 

< 2 13 > Pyrococcus horikoshii 



< 2 2 0 > 

< 2 2 1 > CDS 

<222> (1) . . (1269) 



< 4 0 0 > 1 

atg ccg ctg aaa ttc ccg gaa atg ttt etc ttt ggt acc gca aca tea 48 

Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

1 5 10 15 

tec cat cag ata gag gga aat aat aga tgg aat gat tgg tgg tac tat 96 

Ser His Gin lie Glu Gly Asa Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

2 0 .25 30 

gag cag att gga aag etc ccc tac aga tct ggt aag get tgc aat cac 144 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

3 5 4 0 4 5 

tgg gaa ctt tac agg gat gat att cag eta atg acc age ttg ggc tat 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

50 55 60 

aat get tat agg ttc tec ata gag tgg age agg eta ttc cca gag gaa 

Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

aat aaa ttt aat gaa gat get ttc atg aaa tac egg gag att ata gac 288 

Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

8 5 9 0 9 5 

ttg tta ttg acg aga ggt ata act ccc ctg gtg acc eta cac cac ttt 

Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 



192 



240 



3 3 6 
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4 3 2 



4 80 



5 2 8 



100 105 110 

act age cct etc tgg ttc atg aag aaa ggt ggc ttc ctt agg gag gag 384 
Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

aac eta aaa cat tgg gaa aag tac ata gaa aag gtt get gag ctt tta 
Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 

130 135 140 

gaa aaa gtt aaa eta gta get ace ttc aat gag ccg atg gta tac gta 
Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu. Pro Met Val Tyr Val 
145 150 155 160 

atg atg gga tat eta acg get tat tgg ccc cca ttc att agg agt cca 
Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 

ttt aag gec ttt aag gta get gca aac ctg ctt aaa get cac gca att 576 
Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

180 185 190 

gee tat gaa ctt ctt cat ggg aaa ttc aaa gtt gga ate gta aag aat 624 
Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 2 00 2 05 

att ccc ata ata. etc cca gcg agt gac aag gag agg gat aga aaa gee 672 
lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

get gag aaa get gat aat tta ttt aac tgg cac ttt ttg gat gcg ata 720 
Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 
225 230 235 240 

tgg agt ggg aaa tac aga ggg gta ttt aaa aca tat agg att ccc caa 768 
Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 

24 5 250 255 

agt gac gca gat ttc att ggg gtt aac tat tac acg gee age gaa gta 816 
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Ser Asp Ala Asp Phe lie Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 265 270 

agg cat act tgg aat cct tta aaa ttc ttc ttt gag gtg aaa tta gcg 864 

Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

275 280 285 

gat att age gag agg aag act caa atg gga tgg age gtt tat cca aaa 912 

Asp lie Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

gga ata tac atg gec ctt aaa aaa get tec agg tat gga agg cct ctt 960 

Gly lie Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 
305 310 315 320 

tat att acg gaa aac gga ata gcg acg ctt gat gat gaa tgg aga gtg 1008 

Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

325 330 335 

gaa ttc ata att caa cac etc caa tac gtt cat aag get ate gaa gac 1056 

Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala lie Glu Asp 

340 345 350 

ggc ctg gat gta aga ggt tac ttc tat tgg tea ttt atg gat aac tac 1104 

Gly Leu Asp Val arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 360 365 

gag tgg aaa gag ggg ttt ggg cct aga ttt ggc eta gtg gaa gtt gat 1152 

Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

370 375 380 

tat caa ace ttc gag aga agg ccc agg aag agt get tac gta tac gga 1200 

Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

gaa att gca aga agt aag gaa ata aag gat gag eta tta aag aga tat 1248 

Glu lie Ala Arg Ser Lys Glu lie Lys Asp Glu Leu Leu Lys Arg Tyr 

405 410 415 
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ggc eta cca gaa ctt caa ctt 
Gly Leu Pro Glu Leu Gin Leu 
420 



1269 



< 2 1 0 > 2 
<211> 423 

< 2 1 2 > PRT 

<213> Pyrococcus horikoshii 

< 4 0 0 > 2 

Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

15 10 15 

Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

2 0 .25 30 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

3 5 4 0 4 5 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

5 0 5 5 6 0 

Asn Ala Tyr Arg Phe Ser He Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

8 5 9 0 9 5 

Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 

100 105 110 

Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

Asn Leu Lys His Trp Glu Lys Tyr He Glu Lys Val Ala Glu Leu Leu 
130 135 140 
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Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

14 5 1 5 0 155 1 60 

Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 

Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

180 -185 190 

Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 200 205 

lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 
22 5 230 235 240 

Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 

245 250 255 

Ser Asp Ala Asp Phe lie Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 265 270 

Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

275 280 285 

Asp lie Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

Gly lie Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 
305 310 315 320 

Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

325 33 0 335 

Glu Phe He He Gin His Leu Gin Tyr Val His Lys Ala lie Glu Asp 

340 345 350 

Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 360 365 

Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 
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370 375 380 

Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

Glu lie Ala Arg Ser Lys Glu lie Lys Asp Glu Leu Leu Lys Arg Tyr 

405 410 415 

Gly Leu Pro Glu Leu Gin Leu 
4 2 0 



< 2 1 0 > 3 

< 2 11 > 57 

< 2 12 > DNA 

<213> Artificial Sequence 



< 220 > 

<223> Description of Artificial Sequence:An upper primer designed 
create 

the Ndel site. 



< 4 0 0 > 3 

taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 



< 2 1 0 > 4 

< 2 1 1 > 4 6 
<212> DNA 

< 213 > Artificial Sequence 



< 2 2 0 > 

34 



* 

<223> Description of Artificial Sequence:A lower primer designed to 
create the BamHI site. 

< 4 0 0 > 4 

tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 
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